SD-RSIC: Summarization-Driven Deep Remote Sensing Image Captioning

نویسندگان

چکیده

Deep neural networks (DNNs) have been recently found popular for image captioning problems in remote sensing (RS). Existing DNN-based approaches rely on the availability of a training set made up high number RS images with their captions. However, captions may contain redundant information (they can be repetitive or semantically similar to each other), resulting deficiency while learning mapping from domain language domain. To overcome this limitation, article, we present novel summarization-driven (SD-RSIC) approach. The proposed approach consists three main steps. first step obtains standard by jointly exploiting convolutional (CNNs) long short-term memory (LSTM) networks. second step, unlike existing methods, summarizes ground-truth into single caption sequence and eliminates redundancy set. third automatically defines adaptive weights associated combine summarized based semantic content image. This is achieved weighting strategy defined context LSTM Experimental results obtained RSCID, UCM-Captions, Sydney-Captions data sets show effectiveness compared state-of-the-art approaches. code publicly available at https://gitlab.tubit.tu-berlin.de/rsim/SD-RSIC .

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nonparametric Method for Data-driven Image Captioning

We present a nonparametric density estimation technique for image caption generation. Data-driven matching methods have shown to be effective for a variety of complex problems in Computer Vision. These methods reduce an inference problem for an unknown image to finding an existing labeled image which is semantically similar. However, related approaches for image caption generation (Ordonez et a...

متن کامل

Improving Automatic Image Captioning Using Text Summarization Techniques

This paper presents two different approaches to automatic captioning of geo-tagged images by summarizing multiple web-documents that contain information related to an image’s location: a graph-based and a statistical-based approach. The graph-based method uses text cohesion techniques to identify information relevant to a location. The statistical-based technique relies on different word or nou...

متن کامل

Deep Learning-Based Classification of Remote Sensing Image

Deep Learning networks have sharply increased over the past 10 years, and deep Learning-Based Classification of Remote Sensing Image has attracted extensive interest. We trained a multilayer deep learning network to classify the 8 thousand unlabeled remote sensing images from Internet into the 600 different classes. In order to improve the efficiency, and shorten the experiment time, we also us...

متن کامل

Deep Self-taught Learning for Remote Sensing Image Classification

This paper addresses the land cover classification task for remote sensing images by deep self-taught learning. Our selftaught learning approach learns suitable feature representations of the input data using sparse representation and undercomplete dictionary learning. We propose a deep learning framework which extracts representations in multiple layers and use the output of the deepest layer ...

متن کامل

Knowledge-driven Information Mining in Remote-Sensing Image Archives

attributes Involving a significant amount of high-level reasoning about the meaning and purpose of the objects or scenes depicted (e.g. illegal plantation). They are outside the scope of the current research activity. Figure 2. Logical representation of data and derived attributes extraction of primitive features and the reduction of the resulting data into primitive feature clusters. The primi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Geoscience and Remote Sensing

سال: 2021

ISSN: ['0196-2892', '1558-0644']

DOI: https://doi.org/10.1109/tgrs.2020.3031111